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Abstract 

We consider the following special case of minimizing makespan. A set of jobs J and 
a set of machines M are given. Each job j € J can be scheduled on a machine from a 
subset Mj of M. The processing time of j is the same on all machines in Mj. The jobs 
are of two sizes, namely b (big) and s (small). We present a polynomial-time algorithm 
that approximates the value of the optimal makespan within a factor of 1.883 and some 
further improvements when every job can be scheduled on at most two machines. 

Keywords: approximation algorithms, scheduling, makespan minimization, graph bal- 
ancing. 

1 Introduction 

The problem we consider is a special case of makespan minimization, i.e., the problem of 
scheduling a set of jobs J on a set of machines M with the objective of minimizing the 
maximum machine load. In the most general case of unrelated machines, each job j £ J has 
a processing time pij on machine i S M. An LP-rounding algorithm with an approximation 
factor of 2 along with a proof of 3/2- hardness, unless P = NP, are two classic results of 
[3]. For more than 20 years, no progress has been made either on the approximation of the 
general case or on the lower bound. 

The 3/2 lower bound holds also for the case with assignment constraints. In this setting, 
each job j can be scheduled on any machine from a subset Mj of M. The processing time pj 
of j is the same for any machine in Mj . For this case it was recently shown in [6] that a strong 
LP-relaxation called the configuration LP has an integrality gap of at most 33/17. The proof 
of this exciting result is non-constructive, in the sense that it does not actually provide a 
polynomial-time algorithm that finds such a solution. The best known polynomial-time 
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algorithm that computes a near-optimal schedule is still the 2-approximation algorithm of 
[3]. A slight improvement to 2 — 1/|M| was given by [5]. 

The makespan minimization problem is one of the most important problems in scheduling. 
Advancement on the approximation of the unrelated case appears as one of the top 10 open 
problems in approximation algorithms in the listing of [8]. Due to the importance of the 
problem several special cases have been considered, where further restrictions are imposed 
either on the sets Mj, or on the domain of the processing times. 

In the case of graph balancing each job can be scheduled on at most 2 machines. One can 
interpret this as the following problem: we are given an undirected graph with weighted 
edges and we are asked to direct the edges towards the nodes so as to minimize the maximum 
weighted in-degree over all nodes. An LP-rounding algorithm achieving an approximation 
factor of 1.75 was given in [2]. The LP used is that of [3] with the addition of a valid 
set of inequalities. The considered LP has an integrality gap of 2 when we allow jobs 
that can be scheduled on 3 machines |6j. The case of graph balancing with no parallel 
edges and with integer edge weights had been previously considered in [1] under the name 
graph orientation. Among other results pQ gave a combinatorial (2 — ^^-j-)-approximation 
algorithm when weights are from the set {l,k}. 

The natural case when jobs can be either "big" or "small", i.e., the processing times 
can only take one of two values gives rise to particularly difficult instances. The unrelated 
version of graph balancing with only 2 distinct edge weights seems to capture a major 
portion of the difficulty of the general scheduling problem on unrelated machines. It was 
recently shown in [7] that even when there are 2 distinct edge weights and an edge may 
have different weights for each endpoint, the configuration LP has an integrality gap of 2. 
For scheduling with assignment constraints, 3/2-hardness holds also for the case when there 
are only 2 distinct processing time values [1] and the integrality gap of the natural LP of 
@1 is still 2|. 

Motivated by the above, we study the 2-valued case of minimizing makespan with as- 
signment constraints, where jobs are of two sizes, namely b (big) and s (small). We present 
a simple algorithm for the case when the job sizes are 1 and k G N which approximates 
the value of the optimal solution within a ratio of 2 — r and then we show how to use this 
algorithm as a black-box subroutine to get an approximation when the job sizes are non- 
negative real numbers. Combining our algorithm with the non-constructive result of [6] we 
get an improved non-constructive approximation guarantee of 1.883 for the 2-valued case 
of makespan with assignment constraints. The same algorithm can be used to obtain an 
efficient 1.652 approximation for the 2-valued graph balancing, where additionally for every 
j we have \Mj\ < 2, improving and generalizing the result of [Tj. Note that even for the 
2-valued graph balancing it remains NP-hard to compute a better than 3/2-approximate 
solution [H [2] . 

2 An algorithm for the case with job sizes from the set {1, k} 

This section describes a subroutine which will be used in our algorithms. Apart from a few 
details, the network construction and the rounding argument follow from [3]. 
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Similar to [1] , our algorithm takes as a parameter an estimation T of the makespan of 
the optimal solution. Using binary search, we find the smallest T for which our algorithm 
returns a (possibly fractional) solution. As we will show, that T is a lower bound to the 
makespan of the optimal solution. 

Here and in the following sections we make the assumption that the optimal makespan 
is less than twice the size of the biggest job b. This is w.l.o.g. since the algorithm of [3] 
returns a solution with makespan at most T* + b, where T* is the optimum of the natural 
LP relaxation for the problem. This is formalized in the following lemma. 

Lemma 2.1 If the optimal makespan to an instance of the problem is Topt > 2b, where 
b is the biggest job size, then there is 3/ 2- approximation algorithm. 

The algorithm we describe is used for instances where the sizes of the jobs are integers, 
either 1 or A; > 1. Given such an instance of the problem, we construct the following 
multi-level network. On the first level we have a single source s. On the next level we 
have one node rij for each job j. For each such node rij we have a directed edge from s to 
rij of capacity equal to the size of job j. On the level after that, we have a node Vij, for 
each machine i. This type of nodes is not actually representing the set of machines. Their 
purpose is to limit the amount of flow from big jobs that may enter the machine nodes of 
the next level, enforcing the valid inequality of [2] in our solution. We have a a directed 
edge of capacity k from a node rij to a node vi^ iff j is a big job that can be scheduled to 
machine i. On the next level we have one node mi for each machine i. We have a directed 
edge of capacity 1 from each node nj, where job j is a small job, to a node m« iff job j 
can be scheduled on i. We also have one directed edge of capacity k from Vij, to mj. We 
connect each machine node mi to the sink t at the last level using edges of capacity T, 
where T is our estimation for the makespan. See Fig. [T] for an example. Observe that while 
the flow from small job nodes is routed directly to the machine nodes, the flow from the big 
job nodes is routed through the nodes Vi^ and then to the machine nodes. Thus at most k 
units of flow from big job nodes enter each machine node. 

A feasible flow solution is one that sends a total amount of flow equal to the sum of 
the processing times of the jobs. Using binary search we find the minimum value of T 
for which such a feasible solution exists. This is a lower bound on the optimum. To see 
why this is true, consider a solution S to the problem instance with makespan T. We 
construct the following network flow solution for the network corresponding to the instance 
with estimation T or higher: for each job j, we send from s to rij flow equal to the size of 
j. If j is a small job assigned to machine i by S, then we send 1 unit of flow from rij to 
mi. If j is a big job assigned to machine i by S, then we send k units of flow from rij to 
Vi t b and then k units to m; from Vi t b (there is at most one such big job for each machine - 



recall that because of Lemma 2.1 we consider only instances with T < 2b). We send from 
each node mi to the sink t all amount of flow that has received, which is at most T. We 
interpret the network solution naturally as a fractional assignment: if a fraction / of the 
flow that source s sends to node rij is routed through the machine node mi then we assign 
a fraction / of job j to machine i. 

After solving the network flow problem we obtain a fractional assignment S in which 
small jobs are integrally assigned and for every big job j that has a non-zero fraction Xij 
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Figure 1: Example network construction for an instance with 4 jobs and 2 machines. Edges 
have unit capacity unless indicated otherwise. 

assigned to machine % 6 Mj, X{j > 1/k. In particular by flow integrality, if xu > 0, then Xij 
will be an integral multiple of 1/k. 

The next lemma shows that we can assign each big job to a machine that has a fraction 
of at least 1/k of it, so that each machine gets at most one big job. The proof of the lemma 
is easy and is omitted. 

Lemma 2.2 Let Jj, be the set of big jobs and S(Jb) be the set of machines that have a 
non-zero fraction of a big job. We can find in polynomial time a feasible schedule of the 
jobs of Jb to the machines of S(Jb) so that at most 1 big job is assigned to each machine. 

The value of the optimal solution is at least k, the size of a big job. By Lemma |2.2| in 
our solution we increased the load of a machine by at most k — 1 . This results in a 2 — 1 / k 
approximation. 

Proposition 2.1 The solution we get for an instance of the 2-valued makespan problem 
with job sizes from the set {l,k} by rounding the fractional assignment S resulting from the 
network flow solution, has an approximation ratio of 2 — 1/k. 



3 The Case With Arbitrary Job Sizes 



In this section we will use the algorithm of Proposition 2.1 to design an approximation 
algorithm for the more general case where the two job sizes are arbitrary nonnegative real 
numbers. Without loss of generality we assume that 6=1. We can easily normalize the 
job sizes by dividing them by b. We also assume that the size of small jobs is 1/a for some 
a > 1. We can reduce this problem to the previous one by changing the size of small jobs 
to either l/|~af| or l/[aj and use the previous algorithm, since in the modified instance 
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the size of the big jobs is an integer multiple of the size of the small jobs. We will use 
the resulting assignment as a solution to our original instance. Below we prove by doing 
so that the approximation ratio is bounded by certain expressions. Later we will combine 
those expressions with the result of |6] to get an improved non-constructive approximation 
guarantee. 



Definition 3.1 Let I be the original instance of our problem with job sizes in {l/a,l}, 
a > 1. Let 1% be the instance where we change the small job sizes to l/fa] and L2 be the 
instance resulting from changing the small job sizes to l/[aj • Additionally let f\ = \a] /a be 
the factor by which the small jobs in I are greater than the small jobs in I\ , and fi = [a\ / a 
be the factor by which the small jobs in I are smaller than the small jobs in I2. 



We proceed by bounding the approximation ratio in the case we use the solution of 
instance I\ as a solution to /. We focus on the load of some machine i. Let Opt\ be the 
cost of the optimal solution of instance I\ and Opt be the cost of the optimal solution 
for the original instance I. In the fractional assignment resulting from the solution of the 
network flow for instance I±, let B\ be the total fraction of big jobs that are assigned to i. 
If B\ > then B\ > 1/ \a\ by flow integrality. Then the load of i due to small jobs is at 
most Opt\ — B\. After the rounding the load of i is at most 1 + Opt\ — B\ where the load 
due to big jobs is 1 and the load due to small jobs is Opt\ — B\. If B\ = 0, only small jobs 
are assigned to machine i, then the load of i does not increase during the rounding and thus 
the total load of i is at most Opt\. 

If we use the forementioned solution of I\ as a solution to /, since the small jobs in 
/ are f\ times greater, the load of a machine i that is assigned a big job will be at most 
l + {Opt\ — B\)f\. Also we have that Opt > Opt\. So, regarding the constructed solution for 
the original instance I, the ratio of the load of i to the optimum makespan is 1 +(°p^~- Bi )/ 1 < 
i+(Opti-Bi)/i ^ gj nce ^ > and Opt\ > 1 we upper bound the former ratio once more 

(setting B\ = l/\a] and Opt\ = 1) by 1 + /1 — 1/a . If i is assigned only small jobs, then 
the corresponding load is at most Opt\f\ in instance / and the corresponding ratio is at 
most f\. Clearly 1 + f\ — l/a>/i, since a > 1. Thus we can bound the approximation 
ratio achieved using the solution of I\ by the ratio of the first case: 1 + f\ — 1/a. 

Now we will make a similar analysis for the case we use the solution of I2 as a solution 
to /. Let Opt2 be the value of the optimal solution of I2 (Opt2 > 1). We once again 
focus on the load of a single machine i. Using the same reasoning as above, if i was 
assigned a non-zero fraction B2 of big jobs, we have B2 > l/|_ a J by flow integrality. The 
load of i after rounding the fractional assignment is at most 1 + {Opt2 — B2). The small 
jobs of I have size /2 times the size they have in I2. So the load of i for instance / is 
at most 1 + (Opt2 — i?2)/2 • As for the value Opt of the optimal solution of /, we know 
that Opt > 0^2/2 (since the optimal solution to / with cost Opt induces a solution to 
I2 which is at most Opt/ f2 )■ Thus the ratio of the contructed solution to the optimal 
cost is 1 +(°p^~- 52 )/ 2 < 1+ ^°Qpl~^ 2 ^ 2 ■ Like before, we upper bound the former expression 
by setting Opt2 = 1 and B2 = 1 / [a\ and we get I//2 + 1 — 1/ |_ Q J • If i is assigned 
only small jobs, then the load does not increase during the rounding and is at most Opt2 
regarding instance 12- The corresponding load of i in the solution for instance I is at 
most 0^2/2- The ratio of this case is Opt2f2/Opt < 0^2/2/0^2/2 = 1- Since a > 1, 



5 



I//2 + 1 — l/L a J = a /L Q J + 1 — VL Q J > 1- So, once again, the expression that bounds the 
approximation ratio achieved is I//2 + 1 — 1/ [cx\ . 

We use the solution of the instance that achieves the minimum approximation ratio. We 
have thus proved the following theorem: 

Theorem 3.1 For any instance of the 2-valued makespan with assignment constraints and 
with job sizes in {1,1/ a} , a > 1, we can find in polynomial time a solution which has an 
approximation ratio min{l + fi — 1/a, I//2 + 1 — l/L a J} 

The approximation we have achieved so far depends on a. We can calculate the worst 
case approximation given by the above theorem for interval (n, n + 1) which contains a, by 
setting the two expressions to be equal, since one is decreasing and the other is increasing 
in a, for a given interval (n, n + 1). 

According to the above, the worst approximation achieved for a £ (n, n + 1) is for the 
value of a for which: 

l + /i-l/a = l// 2 + l-l/|aJ 
1 + =±i _ i/ a = & + 1 _ i/ n 

a 2 — a — n 2 = 
The solution is a G { Wi+4n2 ? l+V W | n (n,n + 1). 

The above calculation gives an approximation guarantee of 1.883 for values of a up to 5. 
For values of a > 5 the small job size is 1/a < 0.2 and therefore it is preferable to use the 
following: 

Theorem 3.2 |6j/ If an instance of the scheduling problem only has job sizes s > and 1, 
then the configuration LP has integrality gap at most 5/3 + s . 

The above theorem was actually proved in [6] with the assumption that T = 1. The 
proof, however, can be easily generalized to the case where T < 2 by making some minor 
changes. The main idea is changing 1 to T wherever the proof refers to the makespan (i.e., 
the integrality gap becomes (T + 2/3 + s)/T). Note that for a > 5 the approximation of 
the above theorem is at most 5/3 + 0.2 < 1.883. So the following has been proved: 

Theorem 3.3 We can estimate the optimal makespan of a 2-valued instance with assign- 
ment constraints and with jobs of two sizes within a factor of 1.883 in polynomial time. 

4 2-valued case of graph balancing 

In this section we present a 1.652- approximation for graph balancing instances, where the 
edge weights belong to the set {b, s}. Our proof relies on a modification of the proof for the 
more general case of 2-valued makespan with assignment constraints. We will first prove a 
tight 3/2 approximation when b = k, s = 1, k £ N. 
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Let us consider the multi-level network denned above for this case where the graph edges 
take the role of jobs and the nodes take the role of machines. The key difference here is 
that for each job node there are at most 2 flow paths to the set of machine nodes. So in 
the fractional solution resulting from the flow solution, for each big job j there are at most 
2 machines that have nonzero fraction of that job. Moreover for each big job either there 
is one machine with a fraction greater than 1/2 of j, or there are 2 machines with exactly 
a 1/2 fraction of j each. This is due to the constraint enforced by nodes that each 
machine may take a total fraction of big jobs of at most 1. The small jobs are assigned 
integrally due to integrality of flow. 

Now we show how to round the fractional assignment: for each big job j for which 
there is a machine i with a fraction greater than 1/2 of j assigned to it, assign j to i. 
As for the big jobs that are assigned to 2 machines with a fraction of exactly 1/2, we 
consider the graph induced by the corresponding edges (only for those big jobs) and the 
nodes(machines) covered by those edges. Since each node has degree at most 2 (otherwise a 
machine is assigned at least 3/2 big jobs, which cannot happen), the graph is a collection of 
disjoint paths and cycles. It is easy to find a 1 — 1 assignment of the edges to the nodes (i.e., 
make a clockwise assignment on cycles, direct each path arbitrary and assign each edge to 
its head). 

Note that in the resulting assignment, each node-machine which had a fraction at least 
1/2 of some big edge-job may end up taking the whole edge-job. We have increased the 
cost of the fractional solution by 1/2 at most and since the cost of the optimal solution is 
at least 1, we have achieved a 3/2 approximation, matching the lower bound. 

Theorem 4.1 For instances of graph balancing in which b = k, s = 1, k € N, we can find 
in polynomial time a 3/ '2- approximate solution. 

As in the previous section, we use the above algorithm as a black box to solve the case 
with edge weights in {b, s} or w.l.o.g. in {1, 1/a}. Following the exact same argumentation, 
we can reduce this case to the previous one, by rounding the small job size to either I/fa] 
or to 1/LaJ- For both cases, the worst approximation ratio arise again from the case where 
machine i has a non-zero fraction of big jobs and received a whole big job after the rounding. 
Now we know that i had at least 1/2 fraction of some big job. The expressions giving the 
approximation in each case are 1 + /i/2 and I//2 + 1/2 respectively and are obtained in 
the same manner as in Section |3j Once again we balance the expressions in each interval 
(n,n+ 1) getting the following: 

1 + A/2 = l// 2 + 1/2 o 
2a 2 — na—(n + l)n = 

Assume a > 2. Solving the above equation for the intervals, the worst approximation 
achieved is when a S (2, 3), which is less than 1.652. 

When a £ (1, 2) we use a different approach: in this case s = 1/a > 1/2. If Opt = 1 then 
it is obvious that we can find an optimal solution via bipartite maching. If Opt > 1, then 
we know that Opt > 2s, since there must be a machine with at least 2 jobs assigned in each 
solution. Consider the case s > 0.65. If Opt > 2s then Opt > 1 + s > 1.65 (since we have 
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either 1 big job and 1 small in some machine or we have more than 3 small jobs in some 
machine) and the algorithm of [I] gives a 1 ^"gg 5 < 1.652 approximation. If Opt = 2s using, 
e.g., the LP and the cycle canceling of [2 J we get a fractional solution of cost at most Opt, 
for which the corresponding graph induced on the fractionally assigned nodes (machines) 
is a forest. Consider a tree t, and further consider a leaf node I. The load of the integrally 
assigned jobs of I can either be 1 or s (I cannot have 2 integrally assigned jobs and one 
fractionally since Opt = 2s). We assign the fractional job to I, resulting to a total load of 
at most 2. We do the same in a bottom-up manner for each node of the tree. The resulting 
approximation is 2/2s which is less than 1.652 for s > 0.65. If s < 0.65 then the reduction 
to I\ done previously gives an approximation of at most 1 + fi/2 = l + ^ = l + s< 1.652. 
Note that, for all the above, we do not need to know the value of Opt, we just keep the best 
solution of the mentioned approaches. 

Theorem 4.2 For instances of graph balancing in which the edge weights belong to the set 
{b,s} we can compute in polynomial time a 1.652 -approximate solution. 
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